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Abstract 

Choosing a uniformly sampled simple directed graph realization of a degree sequence has 
many applications, in particular in social networks where self-loops are commonly not allowed. 
It has been shown in the past that one can perform a Markov chain arc-switching algorithm to 
sample a simple directed graph uniformly by performing two types of switches: a 2-switch and a 
directed 3-cycle reorientation. This paper discusses under what circumstances a directed 3-cycle 
reorientation is required. In particular, the class of degree sequences where this is required is 
a subclass of the directed 3-cycle anchored degree sequences. An important implication of this 
result is a reduced Markov chain algorithm that uses only 2-switches. 

1 Introduction 

Markov chain Monte Carlo algorithms have been used successfully to uniformly sample realizations 
of both undirected and directed degree sequences 0, Q ■ The algorithms use a sequence of moves from 
a move-set to go from one realization to another. This results in a random walk on a meta-graph, 
where each vertex corresponds to a realization and the edges connecting these vertices correspond 
to moves from the move-set. If the meta-graph is connected with appropriate probability weights 
for the edges (see (H), we will be guaranteed a uniformly sampled realization with the fixed degree 
sequence. 

To sample simple directed realizations (i.e. no self- loops or multi-arcs), there are two types of 
moves in our move-set Q: a 2-switch and the reorientation of a directed 3-cycle C^, where has 
vertex set {wi,W2,i'3} and arc set {{vi,V2), (v2tV^), (v3,wi)}. A 2-switch is given by 



Wl 



V2 



W2 




where dotted lines denote no arcs, with the C3 reorientation given by 
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C3 reorientations can lead to a much larger mixing time in certain circumstances. In this paper we 
identify the cases where C3 reorientations are necessary, give a degree-sequence characterization of 
these cases, and show that we can reduce our move-set to only 2-switches. 

Recently, Berger and Miiller-Hannemann posted a paper with similar results. In [l|, they re- 
discover the result by Rao et al. Q proving connectivity of the meta-graph using 2-switches and 
C3 reorientations. They also implement a Monte Carlo algorithm similar to Rao et al. Q which 
uniformly samples simple realizations from a directed degree sequence, including mixing time cal- 
culations as well. They show, as we do, that the special cases where C3 reorientations are required 
are precisely the subset of Cs-anchored digraphs which we call C^-anchored. Our paper differs in 
that our proof is substantially shorter (built upon the structural characterization of Ca-anchorcd 
digraphs found in 0) and uses the degree sequence characterization of the Ca-anchored digraphs to 
identify the Ca-anchors, as opposed to a more computationally intensive algorithm that requires the 
knowledge of all induced 3-cycles of a given realization. Using the degree sequence characterization 
is much faster (linear in the number of vertices) and allows us to use the more efficient 2-switch 
random walk. We both, however, show that the meta-graph consists of 2*^ isomorphic subgraphs, 
where k is the number of anchored 3-cycles. 

2 Notation 

All directed graphs in this article will be simple, i.e. with no self-loops or multi-arcs. We consider 
integer-pair sequences d = {{df , d~)}fLi and say d is digraphic if there exists a digraph (i.e. directed 
graph) with degree sequence d, denoting the set of digraph realizations of d by R{d). All integer-pair 
sequences are assumed to be digraphic (otherwise R{d) = 0), and thus c?+ and d~ will denote the 
out-degree and in-degree sequences of d, respectively. 

We denote directed graphs by G, with V{G) the vertex set and A{G) the arc set. We will drop 
the reference to G when the digraph is understood through the notation G = {V,A), for example. 
An arc between vertices a and b will be denoted by (a, 6), with the orientation given by the ordering. 

Given a digraph G = {V,A) and vertex sets X,Y C V, we define the subgraph G[X,Y] = 
(X U Y,A[X,Y]), where A[X,Y] = {{x,y) & A : x e X and y e Y}. When X = Y, we have the 
usual definition of an induced subgraph and will denote this by G[X]. 

We will use the vertex labeling notation Vi in place of L~^{i), where L is a bijective labeling 
fimction L : V — > {1, . . . , \V\} going from vertices to coordinates of the degree sequence. 

3 Result 

Given a degree sequence d, we define the meta-graph fid = {V,£), where V is in one-to-one corre- 
spondence with R{d). We will denote & V to be the vertex corresponding to G £ R{d). There 
are two types of edges £ = £2 U £3- (Vg, V^j,) S £2 if there is a 2-switch between them. Similarly, 
(Vg, Vg,) G £3 if there is a G3 reorientation connecting them. We have the following result: 

Theorem 3.1 (Rao et al. The meta-graph fid is connected. 
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We can define a Markov chain random walk on by an appropriate choice of probabihty weights 
for each edge in £. There are many choices for the weights, but for simphcity I wih give as an example 
probability weights induced by a particularly simple random walk algorithm (see @, Q)- Given a 
realization G^"^ G R{d), with probability p attempt a 2-switch and with probability 1 — p a C3 
reorientation. For a 2-switch, choose four vertices without replacement and, if possible, perform 
a 2-switch to arrive at G^"+^^ Otherwise, do nothing, i.e. G'-"+^-' — G^"). Similarly, for a G3 
reorientation choose three vertices without replacement and, if possible, perform a G3 reorientation 
to arrive at G^""*"^^. Otherwise, do nothing. The resulting probabilities for this Markov chain are 
given by 

l(i-p)/(3) if(^d.'^d,)e£3. 

By doing nothing with failed move attempts, we impose self- loops at each realization such that 
'J2\^?^ -Pij — 1- Theorem 13.11 this Markov chain is irreducible, and it is easily seen to be 
symmetric and aperiodic. Thus, there is a unique limiting distribution which by symmetry must be 
the uniform distribution. 

It is mentioned in 0, Q that in most situations one need only use 2-switches, and thus we can 
choose p to be close to 1. The difficulty with this is there are degree sequences where this will lead to 
very long mixing times, due to the rare cases where there is not a path with edges in £2 connecting 
two realizations. The rarity of these cases is also unknown, and so there is no way to know how close 
to 1 one should choose p. What are the structure of these degree sequences, and can we identify 
them? It turns out we can identify them: they are a subset of what arc known as Ca-anchored 
degree sequences, as defined below. 

Definition 3.2 We call a degree sequence d Ca-anchored if it is forcibly C^-digraphic and there 
exists a nonempty set of coordinates J, called a Ca-anchor set, such that for every coordinate 
i ^ J and every G G R{d), there is an induced subgraph C Q G with C' = G3 and Vi G V{G'). All 
realizations G G R{d) are also called Ca-anchored digraphs. 

The structural characterization of Gs-anchorcd digraphs was given in [3] by a digraph decom- 
position using M-partitions. An A/-partition of a digraph G is a partition of the vertex-set V{G) 
into k disjoint classes {Xi, . . . , X^}, where the arc constraints within and between classes are given 
by a symmetric k x k matrix M with elements in {0,1,*} (see 0]). Ma equals or 1 when Xi 
is an independent set or clique, respectively, and is set to * when G[Xi] is an arbitrary subgraph. 
Similarly, for i ^ j, Mij equal to 0, 1, or * corresponds to G[Xi,Xj\ having no arcs from Xi to Xj, 
all arcs from Xi to Xj, and no constraints on arcs from Xi to X^, respectively. 

The subset of G3-anchored digraphs that are the focus of this paper are called Gg-anchored and 
are realizations of G3-anchored degree sequences such that \ J\ = 3i^r, where K is a. positive integer, 

with G[{wj3„_^^,Wj3„^2j'^'j3„+3}] — ^3 for reafizations G G R{d), < n < K — 1. Note that K 
denotes the number of anchored 3-cycles, i.e. those vertices that induce a directed 3-cycle for all 
realizations. Gg-anchored digraphs have a structural characterization given by the following theorem 
(see Fig. [T] for a pictorial representation) : 

Theorem 3.3 (LaMar [4]) The digraph G = {V, A) is a C^-anchored digraph if and only if there 
is a C C V such that G[G] = G3 and an M -partition of G[V — G] with vertex classes given by 
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Figure 1: Left: The 4 vertex classes {C^,C ,C^,C^} of Cg-ancliored digraphs defined by how a 
vertex x in each class connects to a directed 3-cycle C3 with vertex set {u,v,w}. Right: Diagram 
showing the relations within and between the 4 possible vertex classes. Solid and dashed-dotted 
arrows denote forced and allowable arcs, respectively, while the absence of an arrow denotes no arcs, 
is a clique, an independent set, while G[C~] and G[C"'"] are arbitrary subgraphs. 



{C^,C ,C~^,C^}, where each class defines how its elements relate to C as follows: 

C° = {xeV-C : {x,C)U{C,x) C A'^} 

C- = {x eV ~C : {x,C) C A and iCx) C A^} 

C+ = {x eV -C : (C,x) C A and (.t, C) C A'^} 

= {x&V-C : {x,C)U{C,x) C A} 

and matrix M given by 

c± c- C+ C" 

C± / 1 * 1 * \ 

c- 1 * 1 * 

C+ * * ' 

C° \ * * / 



If we define the meta-graph ft'^ = {V,£2), then the following is the main theorem of this paper. 
Theorem 3.4 Q'^ is disconnected if and only if d is -anchored. 

Proof Let d be (7|-anchored and G G R{d). It should be clear that is disconnected, since every 
realization has a directed 3-cycle through the same three vertices, and thus no 2-switches will connect 
realizations with the opposite orientations for that 3-cycle without first removing the 3-cycle. 

Suppose d is a degree sequence with D,'^ disconnected. By Theorem l3.1[ there must be a realization 
G e R{d) with an oriented directed 3-cycle C such that is in one connected component of iljj 
and VJ^, is in another connected component, where G" is found from G by reorienting C. Let 
C = {u, V, w} with {(m, v), (u, w), {w, u)} C A. We want to show that for any vertex x £ V ~ C, x 
must be in one of the vertex classes C", C^, C+ or C^. We will show that {C,x) C A and/or A'~^ 
(by symmetry, we will also have {x,C) C A and/or A'~^). Suppose there is only one of the three 
arcs, and without loss of generality choose {v, x) G A, with {(u, x), {w, x)} C A^ (the case with two 
existing arcs follows by considering the graph complement). The left panel in Fig. [2] shows that we 
can perform a series of 2-switchcs to reorient the 3-cycle, which contradicts G and G' being in two 
separate connected components of il'^. Thus, we must have {C,x) C A and/or A"^ , showing x G C°, 
C+ or C±. 
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Figure 2: Both left and right panels show a series of 2-switches used to reorient a directed 3-cycle 
C3 (see Theorem 13. 4p . The solid dots denote entries of the adjacency matrix which are not used in 
the series of moves. 



Now we must show that the connections between the vertex classes are given by the M-matrix 
in Theorem 13.31 Let x G C" U C"*" and y S C° U C~, and suppose that {x^y) G A. In the right 
panel of Fig. [2l we see again that there is a series of 2-switches which reorients the 3-cycle, showing 
{x^y) ^ A. By considering the graph complement, we can prove (x,?/) G A for x £ U and 
?/ £ U C+. This shows d is (7|-anchored, thereby completing the proof. 

□ 

For every anchored 3-cycle C, there are two isomorphic copies of connected components of Vl'^ 
corresponding to each orientation of C. In general, thus has the following form. 

Corollary 3.5 

where G2 is one connected component of 0'^ and k denotes the number of anchored 3-cycles. 

The real power of this result is the knowledge that if we know where the anchored 3-cyclcs are, 
then we can simply choose an orientation for each anchored 3-cycle uniformly at random, and then 
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perform a random walk on the graph fl'^. This will be an efficient procedure if the identification of 
the anchored 3-cycles can be done without too much work. It was shown in [3| that Cg-anchored 
digraphs have not only a structural characterization as given in Theorem 13.31 but also a degree- 
sequence characterization. In other words, we can identify the anchored 3-cycles using a simple 
procedure on the degree sequence itself. To this end, we start with some definitions. 
Given an integer sequence a, define the corrected conjugate sequence a" by 



4^\h\ + \Jk\ 



where 



Ik ~ {i \ i < k and ai > k — 1}, 
Jfc = {i \ i > k and flj > fc}. 

Definition 3.6 A degree sequence d = {(rf-i'^, *s non-increasing relative to the positive 

lexicographical ordering i/ and only if df > df_^i, with d^ > d^^^ when df = df^^. In this case, 
we will call d positively ordered and denote the ordering by di > di+i. We say d is non-increasing 
relative to the negative lexicographical ordering by giving preference to the second coordinate, 
calling d in this case negatively ordered and denoting the ordering by di >: di^i . 

For a given degree sequence d ~ {{df , d~)}fLi, define the sequences d = {{df,d~)}fLi and 
d — {{dit id4)}^i to be the positive and negative orderings of d, respectively. 
For a degree sequence d, define the slack sequences s and s by 



si 



Y.[d-]':-Y.dt with ,so 

i=l 1=1 



i=l i=l 

Theorem 3.7 (LaMar [4:]) The degree sequence d = {{df ,d~)}^i is C^-anchored if and only if 
there are coordinates {ji,j2,j3} and an integer-pair {k,l) > (1,1) such that 

dji = dj,_ = dj., = {k,l) (1) 

with 

{dj,,dj^,dj^) = (J/, J/+i,J/+2) = {dkAk+iAk+2) (2) 
and the slack sequences satisfying 

(0,1,1,0) = {Sl-i, 31,31+1,31+2) = (Sfe_i,Sj.,Sfc+i,Sfe+2)- (3) 

In this case, {vj-^,Vj^,Vj.^} induces an anchored 3-cycle. 

Algorithm 3.8 To achieve a uniformly sampled simple realization of a degree sequence d, we check 
if d is -anchored and identify the anchored 3-cycles using Theorem \3. 7[ randomly assign an 
orientation to each anchored 3-cycle with equal probability (which effectively chooses a connected 
component of ^'^), and then perform a random walk on this component of 17^ using only 2-switches. 
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4 Conclusion 



We have shown that the degree sequences that require both types of move sets, i.e. 2-switches 
and directed 3-cycle reorientations, are the C|-anchored degree sequences whose degree sequences 
have been characterized in [J. This characterization aUows for a fast algorithm that identifies the 
(73-anchor sets, leading to a Monte Carlo algorithm involving only 2-switches. 

This is the second instance where Ca-anchored degree sequences have been found to be the special 
cases in algorithms involving directed graphs (for the first case, see Q). It is interesting to see where 
else this structure may be important. 
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